Video processing method, computer device, and storage medium

ABSTRACT

A video processing method is provided. In the method, circuitry of a terminal determines a second video frame portion associated with a second time point that is after a first time point associated with a first video frame portion of a video. The circuitry generates a transitional video frame based on the first video frame portion and the second video frame portion. A color value of a pixel at a target pixel location in the transitional video frame is within a target color interval. The target color interval is determined according to a color value of a pixel at the target pixel location in the first video frame portion and a color value of a pixel at the target pixel location in the second video frame portion. The circuitry performs display control of the video according to the transitional video frame.

RELATED APPLICATION

This application is a continuation of International Application No.PCT/CN2018/111971, filed on Oct. 25, 2018, which claims priority toChinese Patent Application No. 201711021898.0, entitled “VIDEOPROCESSING METHOD AND APPARATUS, INTELLIGENT TERMINAL, AND STORAGEMEDIUM” filed on Oct. 27, 2017. The entire disclosures of the priorapplications are hereby incorporated by reference in their entirety.

FIELD OF THE TECHNOLOGY

This application relates to the field of video technologies.

BACKGROUND OF THE DISCLOSURE

A panoramic video is a video that can be obtained by all-roundphotographing according to requirements, or a video obtained by imagemaking, and can present video content in 360 degrees. When watching apanoramic video, a user may freely perform upward, downward, leftward,and rightward play control on the panoramic video according torequirements. Panoramic videos are mostly applied to scenarios such astourism exhibition, venues, or city introduction.

A virtual reality (VR) device is one device that can play a panoramicvideo. The VR device may be worn on a user's head, and the user maydirectly rotate the head to implement upward, downward, leftward, andrightward play control on the panoramic video, to watch image contentdesired by the user.

How to perform play control on the panoramic video to meet requirementsof the user has become a hotspot of study.

SUMMARY

According to various embodiments provided in this application, a videoprocessing method, a computer device, and a non-transitorycomputer-readable storage medium are provided. According to an aspect, avideo processing method is provided. In the method, circuitry of aterminal determines a second video frame portion associated with asecond time point that is after a first time point associated with afirst video frame portion of a video. A transitional video frame isgenerated by the circuitry based on the first video frame portion andthe second video frame portion. A color value of a pixel at a targetpixel location in the transitional video frame is within a target colorinterval. The target color interval is determined according to a colorvalue of a pixel at the target pixel location in the first video frameportion and a color value of a pixel at the target pixel location in thesecond video frame portion;. Display control of the video is performedby the circuitry according to the transitional video frame.

According to an aspect, motion speed information is obtained by thecircuitry. Second pose information of the second time point is obtainedby the circuitry according to first pose information of the first timepoint, the motion speed information, and a difference between the firsttime point and the second time point. A second video frame of the videoto be displayed at the second time point is obtained by the circuitry.The second video frame portion is obtained by the circuitry from thesecond video frame according to the second pose information.

According to an aspect, a width of the second video frame portionobtained from the second video frame is greater than a width of a videoimage that is displayed. The displayed video image is a partial image ofthe second video frame.

According to an aspect, a height of the second video frame portionobtained from the second video frame is greater than a height of thedisplayed video image.

According to an aspect, the motion speed information indicates rotationangular velocity values in at least two different motion directions. Thesecond pose information is visual field orientation angle information. Aplurality of different motion directions is associated with initialvideo frame groups, and each initial video frame group includes acandidate second video frame portion and a candidate transitional videoframe generated based on the second video frame portion.

According to an aspect, a video frame refresh frequency of the terminalis obtained by the circuitry. A video frame play frequency duringplaying of the video is obtained by the circuitry. A number oftransitional video frames is obtained by the circuitry according to thevideo frame refresh frequency and the video frame play frequency. Thedetermined number of transitional video frames are generated.

According to an aspect, a first color value of the pixel of the targetpixel location in the first video frame portion is obtained by thecircuitry. A second color value of the pixel of the target pixellocation in the second video frame portion is obtained by the circuitry.The color value of the transitional pixel at the target pixel locationis obtained by the circuitry based on an interpolation according to thefirst color value and the second color value;. The transitional videoframe is generated by the circuitry according to pixel values of aplurality of transitional pixels, which includes the transitional pixel.

According to an aspect, differential calculation is performed by thecircuitry between color channel components in the first color value andcolor channel components in the second color value, to obtain componentdifferences of the color channel components. Change amounts of the colorchannel components are obtained by the circuitry according to a numberof transitional video frames that need to be generated. The color valueof the transitional pixel of the target pixel location of each of thenumber of transitional video frames is obtained by the circuitryaccording to the color channel components of the first color value, thenumber of the transitional video frames that need to be generated, andthe change amounts of the color channel components.

According to an aspect, the transitional video frame is generated basedon all pixels of the first video frame portion and all pixels of thesecond video frame portion.

According to an aspect, the color value of the pixel at the target pixellocation in the first video frame portion is one of black and white, andthe color value of the pixel at the target pixel location in the secondvideo frame portion is the other one of black and white.

According to an aspect, a motion direction is detected by the circuitry.An initial video frame group associated with the motion direction isselected by the circuitry, to obtain a target initial video frame group.The display control of the video is performed by the circuitry accordingto the target initial video frame group.

According to an aspect, a visual field angle parameter of the terminalis obtained by the circuitry. A pixel range to be display is determinedby the circuitry according to the visual field angle parameter. Pixelcoordinate matching is performed by the circuitry on each video frameportion in the target initial video frame group according to the pixelrange, to form a transitional play video frame and a second play videoframe. The transitional play video frame and the second play video frameare sequentially played by the circuitry.

At least one non-transitory computer-readable storage medium storinginstructions which when executed by one or more processors cause the oneor more processors to perform any of the image processing methods.

BRIEF DESCRIPTION OF THE DRAWINGS

FIG. 1a is a schematic diagram of a scenario of playing a panoramicvideo according to an embodiment of this application.

FIG. 1b is a schematic flowchart of determining a visual plane in apanoramic video according to an embodiment of this application.

FIG. 2a is a schematic diagram of pose information in differentdirections according to an embodiment of this application.

FIG. 2b is a schematic diagram of an image area in a panoramic videoframe in different visual field orientation angles according to anembodiment of this application.

FIG. 3 is a schematic diagram of a play video frame played for a user ina panoramic video playing process according to an embodiment of thisapplication.

FIG. 4 is a schematic flowchart of a method for determining a colorvalue of each transitional initial video frame according to anembodiment of this application.

FIG. 5 is a schematic flowchart of a video processing method accordingto an embodiment of this application.

FIG. 6 is a schematic flowchart of another video processing methodaccording to an embodiment of this application.

FIG. 7 is a schematic structural diagram of a video processing apparatusaccording to an embodiment of this application.

FIG. 8 is a schematic structural diagram of an intelligent terminalaccording to an embodiment of this application.

FIG. 9 is a diagram of an internal structure of an intelligent terminalaccording to an embodiment of this application.

DESCRIPTION OF EMBODIMENTS

To make the objectives, the technical solutions, and the advantages ofthis application clearer, the following further describes thisapplication in detail with reference to the accompanying drawings andthe embodiments. It should be understood that the specific embodimentsdescribed herein are only used to describe this application, instead oflimiting this application.

In an embodiment of this application, when a panoramic video (e.g., apartially or fully immersive video) is played, play control may beperformed in a plurality of motion directions according to requirements,so that a video frame that can be played and displayed to a user can beobtained through adjustment in a panoramic video frame. The panoramicvideo may be played by using a VR device. When the panoramic video isplayed by using the VR device, the user may rotate a head in up, down,left, and right directions, to watch video images at differentlocations. As shown in FIG. 1a , an image corresponding to an image area101 is displayed on a display interface in a first visual fieldorientation angle, and an image corresponding to an image area 102 isdisplayed on a display interface in a second visual field orientationangle. In other embodiments, the panoramic video may also be played byusing a dedicated panoramic video playing device. A description is madeby using a VR device as an example.

In a process in which the user wears the VR device to play the panoramicvideo, almost all light sources are emitted by the VR device. Thebrightness change may cause a problem of uncomfortable watching. In thisembodiment of this application, play control may be performed on thepanoramic video according to brightness of an image in a played videoframe, and after the control, a degree of the brightness change betweenvideo frames played to the user is reduced. When watching the panoramicvideo played by the VR device, two eyes of a person are fully immersedin a playing environment of the panoramic video. When the panoramicvideo frame is played in the VR device, brightness of emitted light mayfiercely change, or brightness of a part of an area in a large-areablock in the panoramic video frame is relatively high. In the cases, theforegoing problem of uncomfortable watching caused by the brightnesschange may occur.

In this embodiment of this application, to control playing of thepanoramic video, interpolation correction is mainly performed on a pixelin a corresponding location area in a panoramic video frame that may beplayed, so that brightness of a video pixel seen by the user can performgradient transition, and there is the maximum amount of time to makeeyeballs adapted, to further reduce stimulation of the VR device on eyesof the user. In an embodiment, a watching location of the user may bepredicted. For example, a visual field orientation angle after the userrotates the head is predicted, and play control is performed on thepanoramic video according to the predicted watching location. In anembodiment, a prediction process mainly includes: directly predictingwatching locations in 5 directions within a next unit of time, forexample, 1/30 second, of a user when the user moves upward, downward,leftward, rightward, or keeps the head still. Then video frameprocessing is performed on the panoramic video according to theprediction result and the panoramic video frame to be displayed to theuser in the next unit of time. The video frame processing includesprocessing on a transitional initial video frame.

As shown in FIG. 1b , to display images of an image area 101 and animage area 102 of a panoramic image on a display interface, a location(x,y,z) of the user in a three-dimensional space needs to be obtained inS101, and visual field orientation angle information (α,β, γ) of theuser is obtained in S102. (α,β, γ) is a coordinate point under aspherical coordinate, γ may be 1, and different visual fieldorientations are determined based on different α and β. In thisembodiment of this application, the panoramic video is played by usingthe VR device. The VR device includes a head mounted display (HMD)apparatus. A screen in the HMD apparatus is used to provide a displayinterface to display a corresponding video frame in real time. Inaddition, eyeballs of the user can also be tracked in the HMD apparatus,and real-time motion data of the eyeballs can be obtained, so thatvisual field orientation angle information is determined according tothe motion data of the eyeballs. The VR device may further include atracking system formed by a motion sensor. The tracking system can tracklocation information and motion information of the user in a realthree-dimensional space, to determine the visual field orientation angleinformation according to the location information and the motioninformation. In an embodiment, eyeballs or the head may move, and visualfield orientation angle information at a corresponding time point may becomprehensively determined according to a motion trajectory of theeyeballs and a motion trajectory of the head. The VR device includes: acalculation processing unit that can obtain real-time location andmotion information of the user from the tracking system and/or obtaineyeball motion data obtained by tracking the eyeballs, and calculate athree-dimensional coordinate of the user head in a virtualthree-dimensional space, and with reference to the eyeball motion dataobtained by tracking the eyeballs, calculate a visual field orientationof the user in the virtual three-dimensional space, and athree-dimensional coordinate of a real-time point of attention of theuser. After the visual field orientation angle information of the useris determined, a visual plane of the user on a panoramic video framethat needs to be played may be determined in S103. An area correspondingto the visual plane is used as an image area, and a play video framethat can be displayed on the display interface is generated according toan image in the image area. In an embodiment, in a process ofdetermining a visual plane in S103, a visual plane of vision persistenceof the user may be obtained based on a series of experience data, suchas an optimal visual range d (for example, an experience value may be2.5 m), a field of view (FOV) (for example, an experience value may be90°) indicating a visual field width and based on a location (x,y,z) ofthe user head and a visual field orientation (α,β, γ). The length andthe width of the visual plane and a spatial three-dimensional coordinateof each point on the visual plane can be specifically obtained.

In this embodiment of this application, FIG. 2a is a schematic diagramof pose information when a next unit time point T+1 (may be consideredas a second time point) is predicted at a current time point T (may beconsidered as a first time point). The pose information involved in FIG.2a mainly refers to visual field orientation angle information of theuser. Pose information of the second time point in other embodiments mayrefer to displacement information of the user head, for example,leftward displacement or rightward displacement. In FIG. 2a , poseinformation of the first time point is initial visual field orientationangle information of the user, and is (α,β, γ). According to therotation angle speed*the unit time of the user, it is obtained thatpossible viewing angles of the user are upward (α1, β1, γ1), downward(α2, β2, γ2), leftward (α3, β3, γ3), and rightward (α4, β4, γ4), andcertainly, it is possible that the visual field orientation does notchange, and is still (α,β, γ). Therefore, at the second time point, theviewing angle orientation angle information is: (α,β, γ), (α1, β1, γ1),(α2, β2, γ2), (α3, β3, γ3), and (α4, β4, γ4). The rotation angle speedof the user may be an experience value, and may be considered as arotation angle speed when the user rotates the head to watch thepanoramic video in a normal case. In other embodiments, visual fieldorientations in different directions may be actually obtained based ondifferent α and β. For example, an oblique upward visual fieldorientation (α5, β5, γ5) or the like may also be obtained.

In FIG. 2b , in different visual field orientation angles, some imagesin different panoramic video frames can be seen. In this embodiment ofthis application, at the second time point, the upward visual fieldorientation angle information (α1, β1, γ1) can correspond to the secondinitial video frame 201 a and the second play video frame 201 b, and thedownward visual field orientation angle information (α2, β2, γ2) cancorrespond to the second initial video frame 202 a and the second playvideo frame 202 b; the leftward visual field orientation angleinformation (α3, β3, γ3) can correspond to the second initial videoframe 203 a and the second play video frame 203 b; and the rightwardvisual field orientation angle information (α4, β4, γ4) can correspondto the second initial video frame 204 a and the second play video frame204 b. In an embodiment, if the user head does not rotate, a new imagealso appears in the middle image area at the second time point, to forma new play video frame. The current initial visual field orientationangle information at the first time point corresponds to the first playvideo frame 200 a and the first initial video frame 200 b.

In this embodiment of this application, interpolation transitioningneeds to be performed between the currently played play video frame andthe four new play video frames that may exist, to obtain one or moretransitional initial video frames and the second initial video frame atthe second time point, so that after an actual rotation direction of theuser is subsequently determined, the transitional play video frame andthe second play video frame that can be projected onto the displayinterface of the VR device for playing to the user can be obtainedaccording to each transitional initial video frame and the secondinitial video frame.

In this embodiment of this application, a video frame obtained throughinitial processing is referred to as an initial video frame, and a videoframe that can be finally projected onto the display interface andplayed to the user is referred to as a play video frame. Each initialvideo frame differs from the play video frame in that: the frame heightof the initial video frame is greater than that of the play video frame,and/or, the frame width of the initial video frame is greater than thatof the play video frame. In an embodiment, the frame width and the frameheight of the initial video frame are correspondingly 1.4 times of theframe width and the width height of the final play video frame. The sizeof each initial video frame is greater than that of each play videoframe, so that it can be ensured that a video pixel range seen by theuser after the user actually rotates is within a pixel range of thetransitional video frame, to ensure that each transitional video framefinally played to the user can be normally played. Certainly, in otherembodiments, the size of each initial video frame may also be the sameas that of the final play video frame.

In an embodiment, a number is determined according to a video frame playfrequency when the panoramic video is normally played and a video framerefresh frequency of the VR device. The number is the number oftransitional initial video frames between the first initial video frameand the second initial video frame. In an embodiment, if the video frameplay frequency when the panoramic video is played is 30 Hz, and the VRdisplay screen refresh frequency is 60 HZ, one transitional initialvideo frame can be determined between every two video frames. If the VRdisplay screen refresh frequency is 90 HZ, two transitional initialvideo frames can be determined between every two video frames. If the VRdisplay screen refresh frequency is 120 HZ, three transitional initialvideo frames can be determined between every two video frames. FIG. 3 isa schematic diagram of a play video frame finally displayed to the userwhen there are three transitional initial video frames. The play videoframes include a first play video frame 301 corresponding to the firstinitial video frame, three transitional initial video frames 302, andthe second initial video frame 303.

After the number of the transitional initial video frames is determined,an interpolation between color values of pixels at a same locationbetween the first initial video frame and the second initial video framemay be obtained based on a preset interpolation rule, to determine acolor value of the transitional pixel according to the interpolation andfurther generate each transitional initial video frame. Theinterpolation rule may be set according to requirements. In anembodiment, the color value that is of the pixel of the target locationin the transitional initial video frame and that is obtained accordingto the set interpolation rule is within a target interval. The targetinterval is a color value interval determined according to the colorvalue of the pixel of the target location in the first initial videoframe and the color value of the pixel of the target location in thesecond initial video frame. In an embodiment, if a plurality oftransitional initial video frames is included, not only it is ensuredthat the color value of the pixel of the target location in thetransitional initial video frame is within the target interval, but alsoit needs to be further ensured that a change amplitude of color valuesof pixels at the target location in two adjacent transitional initialvideo frames is less than a preset amplitude threshold, to ensure thatthe brightness degree of the image can perform gradient transition.

In an embodiment, FIG. 4 is a schematic flowchart of a method fordetermining a color value of a pixel at a target location in eachtransitional initial video frame according to an interpolation ruleaccording to an embodiment of this application. The target location ofthe first initial video frame is a pixel location coordinate, and thetarget location in the second initial video frame is also a pixellocation coordinate. The two target locations are located on differentimages but the image locations are the same. Processing shown in FIG. 4may be performed on each pixel location in the first initial video frameand the second initial video frame as the target location. The firstinitial video frame at the first time point and the second initial videoframe at the second time point are RGB images. Components R1, G1, and B1of an RGB channel of the target location in the first initial videoframe at the first time point are obtained in S401, and components R2,G2, and B2 of an RGB channel of the target location in the secondinitial video frame at the second time point are obtained in S402. Acomponent difference of the channel is calculated in S403. Thecalculation manner may be DR=R2-R1, DG=G2-G1, DB=B2-B1, and thecomponent difference is an absolute value. The change amount of thecolor channel component of the pixel at the target location of eachtransitional initial video frame is obtained according to the determinednumber of transitional initial video frames in S404. The calculationmanner may be DTR=DR/N, DTG=DG/N, and DTB=DB/N. In S405, the color valueof the transitional pixel of the target location of each transitionalinitial video frame is obtained according to each color channelcomponent of the pixel at the target location of the first initial videoframe and the change amount of each color channel component. Thecalculation manner may be: RC1=R1+n*DTR, GC2=G2+n*DTG, and BC3=B3+n*DTB,where n is the number of the transitional initial video frame; if thereare three transitional initial video frames, n is sequentially equal to1, 2, and 3; if there are two frames, n is sequentially equal to 1 and2; if there is only one frame, n=1. Each color value component of thenth transitional initial video frame obtained after S404 is (RC1, GC2,BC3), and a color value of a pixel of a target location in the nthtransitional initial video frame is obtained according to the colorvalue component.

The intermediate transitional initial video frame is determinedaccording to color values of pixels of all locations in the firstinitial video frame and the second initial video frame. In anembodiment, only locations of pixels in the target areas in the firstinitial video frame and the second initial video frame may also beseparately used as the target location. In an embodiment, the targetarea may include: the target area in the first initial video frame iswhite, and the target area in the second initial video frame is black;or, the target area in the first initial video frame is black, and thetarget area in the second initial video frame is white. That is, theforegoing differential processing may be performed only on a jumpingpart from black to white or from white to black in the video frame.Color value interpolation is not performed on pixels of areas other thanone or more target areas. Color values of pixels of other areas may bethe same as the color value of the pixel of the corresponding locationof the first initial video frame, or the same as the color value of thepixel of the corresponding location of the second initial video frame.In an embodiment, the target area may further refer to: in the firstinitial video frame and the second initial video frame, if an absolutevalue of a difference between color values of pixels in the target areais greater than a preset difference threshold, it can be considered thata difference between color values of the pixels at the same location isrelatively large. Similarly, the color values of the pixels outside thetarget area may be the same as the color value of the pixel at the samelocation in the first initial video frame, or may be the same as thecolor value of the pixel at the same location in the second initialvideo frame.

RGB channel components of black and white pixels may be user-defined. Inan embodiment, a series of non-pure-black black in different colorvalues (0, 0, 0) to (1, 1, 1) in RGB may also be defined as black.Reference may be made to the following Table 1 for exemplary definedvalues. As shown in Table 1, when it is detected that the color value ofthe pixel of the target location in the first initial video frame at thefirst time point is #1C1C1C (grey11), and the color value of the pixelat the target location in the second initial video frame at the secondtime point is #828282 (grey51), it may be considered that a differencebetween color values of them is relatively large. If another targetlocation relatively near the target location also has a relatively largecolor value difference, it may be considered that an area formed by thetarget location and the other target location is the target area, andcolor value interpolation needs to be performed based on the targetarea, to form a transitional initial video frame.

TABLE 1 Identifier Color component Color value grey11 28 28 28 #1C1C1Cgrey21 54 54 54 #363636 grey31 79 79 79 #4F4F4F grey41 105 105 105#696969 grey51 130 130 130 #828282 grey61 156 156 156 #9C9C9C grey71 181181 181 #B5B5B5 grey81 207 207 207 #CFCFCF grey91 232 232 232 #E8E8E8

After the color value at each target location is obtained or when it isdetected that the user moves towards a direction, an actual visual fieldorientation angle of the user is obtained, and a corresponding targetinitial video frame group is determined according to the actual visualfield orientation angle. The target initial video frame group includes asecond initial video frame that is in a direction and that is obtainedthrough estimation and a transitional initial video frame. For example,if the user rotates leftward to (α3, β3, γ3), the leftward secondinitial video frame and a transitional initial video framecorrespondingly obtained according to the leftward second initial videoframe are selected. The actual visual field orientation angle of theuser may be detected based on a motion sensor on the VR device.

After the target initial video frame group is obtained, each initialvideo frame in the target initial video frame group needs to beprojected onto a display interface of the VR device. A pixel range to beprojected onto the display interface of the VR device may be obtainedaccording to a field of view (FOV) of the VR device. Pixel coordinatematching is performed on each initial video frame in the target initialvideo frame group according to the pixel range, to select pixels thatneed to be projected, to form an actual transitional play video play anda second play video frame, and then the video frames are sequentiallyplayed on the VR display interface according to a time order. The FOV ofthe VR device is an inherent parameter of the VR device.

In this embodiment of this application, a second time point can bedetermined through prediction, and some transitional video frames aregenerated according to the video frame of the second time point and thevideo frame of the current first time point, so that it can be ensuredthat a degree of a pixel color change between the video frame played atthe first time point and the video frame played at the second time pointis relatively small, and bad stimulation caused to eyes of the userduring playing of the panoramic video can be effectively reduced, and inthe entire process, the user does not need to manually adjust theplaying brightness of devices such as a VR device, to meet therequirements of automation and intelligentization of the user.

FIG. 5 is a schematic flowchart of a video processing method accordingto an embodiment of this application. The method in this embodiment ofthis application may be performed by a terminal device configured toplay a panoramic video. In an embodiment, the method may be performed bya VR device that can play a panoramic video. The method in thisembodiment of this application includes the following steps.

In S501, a second initial video frame of a second time point isdetermined in response to playing a panoramic video, the second timepoint being after a first time point. There may be a difference of oneunit of time between the first time point and the second time point. Theunit of time may be determined according to a video frame play frequencywhen the panoramic video is played. For example, when the video frameplay frequency is 30 Hz, the unit of time is 1/30 second.

In an embodiment, there may be one or more second initial video framescorresponding to the second time point. If a motion direction of theuser is learned in advance, the second initial video frame is a videoframe formed by a partial image in a visual field orientation anglecorresponding to the motion direction in a panoramic video frame playedat the second time point. For example, as shown in FIG. 2a , if it islearned in advance that the user rotates leftward, it may be consideredthat in the panoramic video played at the second time point, thecorresponding second initial video frame can be obtained according to apartial image that can be watched in the (α3, β3, γ3) visual fieldorientation angle corresponding to the leftward motion direction.Similarly, if the motion direction of the user is not learned, theleftward, rightward, upward, downward, and still cases in FIG. 2a may beused as a predicted motion direction, and then five second initial videoframes are determined.

In S502, a transitional initial video frame is generated from a firstinitial video frame of the first time point to a second initial videoframe, a color value of a pixel of a target location in the transitionalinitial video frame being located within a target interval, and thetarget interval being a color value interval determined according to acolor value of a pixel of the target location in the first initial videoframe and a color value of pixel of the target location in the secondinitial video frame. That is, the transitional initial video frame isgenerated between the first time point and the second time point. Thevideo frames make a change in amplitude of the pixel at each targetlocation relatively small. In particular, between the first time pointand the second time point, when a pixel at a target location jumps fromblack to white, based on the transitional initial video frame, thetarget location can gradually change from black to white between thefirst time point and the second time point.

In S503, play control is performed according to the transitional initialvideo frame, to complete playing the panoramic video. The performingplay control according to the transitional initial video frame mainlyincludes processing the transitional initial video frame and the secondinitial video frame, to obtain a transitional play video frame and asecond play video frame that can be actually projected onto the displayinterface of the VR device, and the video frames are played to the user.In an embodiment, a pixel range of the transitional initial video frameand the second initial video frame may be determined based on the FOV ofthe VR device, and then the transitional play video frame and the secondplay video frame are extracted and determined, and the transitional playvideo frames are sequentially played according to a time order betweenthe first time point and the second time point, and the second playvideo frame is finally played.

In an embodiment, the transitional initial video frame generated in S502may be transitional initial video frame data, and an image can begenerated by using the transitional initial video frame data and theimage is displayed on the display interface for the user to watch.

Reference may be made to descriptions of the foregoing content forimplementation of steps in the method in this embodiment of thisapplication.

FIG. 6 is a schematic flowchart of another video processing methodaccording to an embodiment of this application. The method in thisembodiment of this application may be performed by a terminal deviceconfigured to play a panoramic video. In an embodiment, the method maybe performed by a VR device that can play a panoramic video. The methodin this embodiment of this application includes the following steps.

In S601, a motion speed value is obtained. The motion speed value may bean estimated value, and the motion speed value may be dynamicallydetermined according to an angular velocity of a user's head rotation ina process of wearing a VR device to watch the panoramic video in aperiod of time, for example, an average angular velocity within a periodof time.

In S602, second pose information is obtained at the second time pointaccording to first pose information of the first time point, the motionspeed value, and a duration between the first time point and the secondtime point. In an embodiment, a pose change amount, such as an anglechange amount, from the first time point to the second time point may becalculated according to the motion speed value and the duration betweenthe first time point and the second time point, so that second poseinformation is obtained. Because the motion speed value is an estimatedvalue, the second pose information is also an estimated value, and isnot pose information after actual rotation of the user. To facilitatethe subsequent process, the transitional initial video frame iscalculated in advance, so that the transitional play video framecorresponding to each transitional initial video frame is played beforethe second play video frame of the second time point is finally played.

In S603, a panoramic video frame of the panoramic video to be played atthe second time point is obtained. After recording of the panoramicvideo is completed, the panoramic video frame at each time point isknown, and the panoramic video frame can be obtained based on the timepoint, to facilitate subsequent interception of a partial image from thepanoramic video frame, to form the second initial video frame and thesecond play video frame that can be played and displayed on the displayinterface.

In S604, the second initial video frame from the panoramic video frameaccording to the second pose information. S601 to S604 correspond toS501 in the previous embodiment. The second pose information is a pieceof visual field orientation angle information. A partial image can befound and intercepted from the panoramic video frame according to thevisual field orientation angle of the user, to form the second initialvideo frame, and reference may be made to FIG. 3.

In an embodiment, a frame width of the second initial video frameobtained from the panoramic video frame is greater than a frame width ofa play video frame that can be actually played and displayed on adisplay interface; and/or a frame height of the second initial videoframe obtained from the panoramic video frame is greater than a frameheight of the play video frame that can be actually played and displayedon the display interface; the play video frame that can be played anddisplayed on the display interface is a video frame formed by a partialimage that is in the panoramic video frame and that is projected ontothe terminal display interface in the second pose information. Alsoreferring to FIG. 2b , the size of the second initial video frame isgreater than the size of the second play video frame that can beactually displayed on the display interface, so that it can be ensuredthat the transitional play video frame between the first time point andthe second time point can be accurately obtained, and can be normallyplayed.

In an embodiment, the motion speed value includes rotation angularvelocity values in at least two different directions, the second poseinformation is visual field orientation angle information; eachdirection is associated with an initial video frame group, and eachinitial video frame group includes a second initial video frame and atransitional initial video frame obtained according to the secondinitial video frame.

In S605, a transitional initial video frame is generated from a firstinitial video frame of the first time point to a second initial videoframe, a color value of a pixel of a target location in the transitionalinitial video frame being located within a target interval, and thetarget interval being a color value interval determined according to acolor value of a pixel of the target location in the first initial videoframe and a color value of pixel of the target location in the secondinitial video frame.

In an embodiment, S605 may include: obtaining a first color value of thepixel of the target location in the first initial video frame of thefirst time point; obtaining a second color value of the pixel of thetarget location in the second initial video frame; performinginterpolation according to the first color value and the second colorvalue, to obtain the color value of the transitional pixel of the targetlocation of the transitional initial video frame; and after obtainingcolor values of a plurality of transitional pixels, generating thetransitional initial video frame according to the obtained pixel valuesof all the transitional pixels.

In an embodiment, the performing interpolation according to the firstcolor value and the second color value, to obtain the color value of thetransitional pixel of the target location of the transitional initialvideo frame includes: performing differential calculation on a colorchannel component in the first color value and a color channel componentin the second color value, to obtain a component difference of eachcolor channel component; obtaining a change amount of each color channelcomponent according to a number of transitional initial video framesthat need to be generated; and obtaining the color value of thetransitional pixel of the target location of each transitional initialvideo frame according to each color channel component of the first colorvalue, the number of the transitional initial video frames that need tobe generated, and the change amount of each color channel component. Inan embodiment, reference may be made to the descriptions of theembodiment corresponding to FIG. 4 for a calculation process of thecolor value of the transitional pixel.

In an embodiment, pixel locations of all pixels of the first initialvideo frame and the second initial video frame are separately used asthe target location; or, the color value of the pixel of the targetlocation in the first initial video frame is black, and the color valueof the pixel of the target location in the second initial video frame iswhite; or the color value of the pixel of the target location in thefirst initial video frame is white, and the color value of the pixel ofthe target location in the second initial video frame is black.

In an embodiment, the number of transitional initial video frames may bedetermined, to perform interpolation based on a number. The determiningthe number of transitional initial video frames between the first timepoint and the second time point includes: obtaining a video framerefresh frequency of the terminal; obtaining a video frame playfrequency when the panoramic video is played; determining a number oftransitional initial video frames according to the video frame refreshfrequency and the video frame play frequency, a number of the generatedtransitional initial video frames being the same as the determinednumber. In an embodiment, if the video frame play frequency when thepanoramic video is played is 30 Hz, and the VR display screen refreshfrequency is 60 HZ, one transitional initial video frame can bedetermined between every two video frames. If the VR display screenrefresh frequency is 90 HZ, two transitional initial video frames can bedetermined between every two video frames. If the VR display screenrefresh frequency is 120 HZ, three transitional initial video frames canbe determined between every two video frames.

In S606, a motion direction is detected. The motion direction of the VRdevice may be detected according to the sensor on the VR device. Forexample, it may be detected by using a gyroscope whether the VR devicerotates upward, downward, leftward, rightward, or does not move.

In S607, an initial video frame group associated with the motiondirection is selected, to obtain a target initial video frame group. Forexample, after it is determined that the user head rotates leftward, thesecond play video frame 203 a shown in FIG. 2b and each transitionalinitial video frame calculated based on the second play video frame 203a may be selected, to obtain the target initial video frame group.

In S608, play control is performed according to the target initial videoframe group. In an embodiment, the second play video frame and eachtransitional play video frame that can be displayed on the displayinterface of the VR device may be generated according to the sizes ofthe second initial video frame and each transitional initial video framein the initial video frame group, and each second play video frame andeach transitional play video frame are played on the display interface.

In an embodiment, S608 may specifically include: obtaining a visualfield angle parameter of the terminal; determining, according to thevisual field angle parameter, a pixel range projected onto the terminaldisplay interface; performing pixel coordinate matching on each initialvideo frame in the target initial video frame group according to thepixel range, to select pixels that need to be projected to form atransitional play video frame and a second play video frame; andsequentially playing the transitional play video frame and the secondplay video frame according to a time order.

For exemplary implementations of steps in the method according to thisembodiment of this application, refer to the description of relatedcontent in the foregoing embodiments.

In this embodiment of this application, a second time point can bedetermined through prediction, and some transitional video frames aregenerated according to the video frame of the second time point and thevideo frame of the current first time point, so that it can be ensuredthat a degree of a pixel color change between the video frame played atthe first time point and the video frame played at the second time pointis relatively small, and bad stimulation caused to eyes of the userduring playing of the panoramic video can be effectively reduced, and inthe entire process, the user does not need to manually adjust theplaying brightness of devices such as a VR device, to meet therequirements of automation and intelligentization of the user.

In an embodiment, an intelligent terminal is further provided. Aninternal structure of the intelligent terminal may be shown in FIG. 9.The intelligent terminal includes a video processing apparatus. Thevideo processing apparatus includes modules, and each module may becompletely or partially implemented by software, hardware, or acombination thereof.

A video processing apparatus and an intelligent terminal according tothe embodiments of this application are described in detail below.

FIG. 7 is a schematic structural diagram of a video processing apparatusaccording to an embodiment of this application. The apparatus in thisembodiment of this application may be disposed on a terminal forpanoramic play, such as a VR terminal. The apparatus includes thefollowing modules:

a determining module 701, configured to determine a second initial videoframe of a second time point in response to playing a panoramic video,the second time point being after a first time point;

a generation module 702, configured to generate a transitional initialvideo frame from a first initial video frame of the first time point toa second initial video frame, a color value of a pixel of a targetlocation in the transitional initial video frame being located within atarget interval, and the target interval being a color value intervaldetermined according to a color value of a pixel of the target locationin the first initial video frame and a color value of pixel of thetarget location in the second initial video frame; and

a control module 703, configured to perform play control according tothe transitional initial video frame, to complete playing the panoramicvideo.

The modules can be implemented by circuitry for example.

In an embodiment, the determining module 701 is configured to: obtain amotion speed value; obtain second pose information at the second timepoint according to first pose information of the first time point, themotion speed value, and duration between the first time point and thesecond time point; obtain a panoramic video frame of the panoramic videoto be played at the second time point; and obtain the second initialvideo frame from the panoramic video frame according to the second poseinformation.

In an embodiment, a frame width of the second initial video frameobtained from the panoramic video frame is greater than a frame width ofa play video frame that can be actually played and displayed on adisplay interface; and/or a frame height of the second initial videoframe obtained from the panoramic video frame is greater than a frameheight of the play video frame that can be actually played and displayedon the display interface; the play video frame that can be actuallyplayed and displayed on the display interface is a video frame formed bya partial image that is in the panoramic video frame and that isprojected onto the terminal display interface.

In an embodiment, the motion speed value includes rotation angularvelocity values in at least two different motion directions, the secondpose information is visual field orientation angle information; eachmotion direction is associated with an initial video frame group, andeach initial video frame group includes a second initial video frame anda transitional initial video frame obtained according to the secondinitial video frame.

In an embodiment, the generation module 702 is further configured to:obtain a video frame refresh frequency of the terminal; obtain a videoframe play frequency during playing of the panoramic video; determine anumber of transitional initial video frames according to the video framerefresh frequency and the video frame play frequency, a number of thegenerated transitional initial video frames being the same as thedetermined number.

In an embodiment, when configured to generate the transitional initialvideo frame from the first initial video frame of the first time pointto the second initial video frame, the generation module 702 isconfigured to: obtain a first color value of the pixel of the targetlocation in the first initial video frame of the first time point;obtain a second color value of the pixel of the target location in thesecond initial video frame; perform interpolation according to the firstcolor value and the second color value, to obtain the color value of thetransitional pixel of the target location of the transitional initialvideo frame; and after obtaining color values of a plurality oftransitional pixels, generate the transitional initial video frameaccording to the obtained pixel values of all the transitional pixels.

In an embodiment, when configured to perform interpolation according tothe first color value and the second color value, to obtain the colorvalue of the transitional pixel of the target location of thetransitional initial video frame, the generation module 702 isconfigured to: perform differential calculation on a color channelcomponent in the first color value and a color channel component in thesecond color value, to obtain a component difference of each colorchannel component; obtain a change amount of each color channelcomponent according to a number of transitional initial video framesthat need to be generated; and obtain the color value of thetransitional pixel of the target location of each transitional initialvideo frame according to each color channel component of the first colorvalue, the number of the transitional initial video frames that need tobe generated, and the change amount of each color channel component.

In an embodiment, pixel locations of all pixels of the first initialvideo frame and the second initial video frame are separately used asthe target location; or, the color value of the pixel of the targetlocation in the first initial video frame is black, and the color valueof the pixel of the target location in the second initial video frame iswhite; or the color value of the pixel of the target location in thefirst initial video frame is white, and the color value of the pixel ofthe target location in the second initial video frame is black.

In an embodiment, the control module 703 is configured to: detect amotion direction; select an initial video frame group associated withthe motion direction, to obtain a target initial video frame group; andperform play control according to the target initial video frame group.

In an embodiment, when configured to perform play control according tothe target initial video frame group, the control module 703 isconfigured to: obtain a visual field angle parameter of the terminal;determine, according to the visual field angle parameter, a pixel rangeprojected onto the terminal display interface; perform pixel coordinatematching on each initial video frame in the target initial video framegroup according to the pixel range, to form a transitional play videoframe and a second play video frame; and sequentially play thetransitional play video frame and the second play video frame accordingto a time order.

In this embodiment of this application, reference may be made todescriptions of related content in the foregoing embodiments forimplementation of each functional module in the apparatus.

In this embodiment of this application, a second time point can bedetermined through prediction, and some transitional video frames aregenerated according to the video frame of the second time point and thevideo frame of the current first time point, so that it can be ensuredthat a degree of a pixel color change between the video frame played atthe first time point and the video frame played at the second time pointis relatively small, and bad stimulation caused to eyes of the userduring playing of the panoramic video can be effectively reduced, and inthe entire process, the user does not need to manually adjust theplaying brightness of devices such as a VR device, to meet therequirements of automation and intelligentization of the user.

FIG. 8 is a schematic structural diagram of an intelligent terminalaccording to an embodiment of this application. The terminal in thisembodiment of this application may be a terminal that can play apanoramic video, such as a VR device. The intelligent terminal includesstructures such as a power supply module. In this embodiment of thisapplication, a processor 801, a storage apparatus 802, and a datainterface 803 are further included. A data connection is establishedbetween the processor 801, the storage apparatus 802, and the datainterface 803 in manners such as a bus.

The data interface 803 is configured to: receive external data, and sendthe external data to the processor 801 or store the external data intothe storage apparatus 802. The storage apparatus 802 may include avolatile memory, for example, a random-access memory (RAM). The storageapparatus 802 may also include a non-volatile memory, for example, aflash memory, a solid-state drive (SSD), and the like. The storageapparatus 802 may further include a combination of the foregoing typesof memories.

The processor 801 may be circuitry such as central processing unit (CPU)801. The processor 801 may further include a hardware chip. The hardwarechip may be an application-specific integrated circuit (ASIC), aprogrammable logic device (PLD), and the like. The PLD may be afield-programmable gate array (FPGA), a generic array logic (GAL), orthe like.

Optionally, the storage apparatus 802 is further configured to store aprogram instruction. The processor 801 may invoke the programinstruction, to implement the method mentioned in this embodiment ofthis application.

In an embodiment, the processor 801 invokes the program instruction for:determining a second initial video frame of a second time point inresponse to playing a panoramic video, the second time point being aftera first time point; generating a transitional initial video frame from afirst initial video frame of the first time point to a second initialvideo frame, a color value of a pixel of a target location in thetransitional initial video frame being located within a target interval,and the target interval being a color value interval determinedaccording to a color value of a pixel of the target location in thefirst initial video frame and a color value of pixel of the targetlocation in the second initial video frame; and performing play controlaccording to the transitional initial video frame, to complete playingthe panoramic video.

In an embodiment, when configured to determine the second initial videoframe of the second time point, the processor 801 is configured to:obtain a motion speed value; obtain second pose information at thesecond time point according to first pose information of the first timepoint, the motion speed value, and duration between the first time pointand the second time point; obtain a panoramic video frame of thepanoramic video to be played at the second time point; and obtain thesecond initial video frame from the panoramic video frame according tothe second pose information.

In an embodiment, a frame width of the second initial video frameobtained from the panoramic video frame is greater than a frame width ofa play video frame that can be actually played and displayed on adisplay interface; and/or a frame height of the second initial videoframe obtained from the panoramic video frame is greater than a frameheight of the play video frame that can be actually played and displayedon the display interface; the play video frame that can be actuallyplayed and displayed on the display interface is a video frame formed bya partial image that is in the panoramic video frame and that isprojected onto the terminal display interface.

In an embodiment, the motion speed value includes rotation angularvelocity values in at least two different motion directions, the secondpose information is visual field orientation angle information; eachmotion direction is associated with an initial video frame group, andeach initial video frame group includes a second initial video frame anda transitional initial video frame obtained according to the secondinitial video frame.

In an embodiment, the processor 801 is further configured to: obtain avideo frame refresh frequency of the terminal; obtain a video frame playfrequency during playing of the panoramic video; determine a number oftransitional initial video frames according to the video frame refreshfrequency and the video frame play frequency, a number of the generatedtransitional initial video frames being the same as the determinednumber.

In an embodiment, when configured to generate the transitional initialvideo frame from the first initial video frame of the first time pointto the second initial video frame, the processor 801 is configured to:obtain a first color value of the pixel of the target location in thefirst initial video frame of the first time point; obtain a second colorvalue of the pixel of the target location in the second initial videoframe; perform interpolation according to the first color value and thesecond color value, to obtain the color value of the transitional pixelof the target location of the transitional initial video frame; andafter obtaining color values of a plurality of transitional pixels,generate the transitional initial video frame according to the obtainedpixel values of all the transitional pixels.

In an embodiment, when configured to perform interpolation according tothe first color value and the second color value, to obtain the colorvalue of the transitional pixel of the target location of thetransitional initial video frame, the processor 801 is configured to:perform differential calculation on a color channel component in thefirst color value and a color channel component in the second colorvalue, to obtain a component difference of each color channel component;obtain a change amount of each color channel component according to anumber of transitional initial video frames that need to be generated;and obtain the color value of the transitional pixel of the targetlocation of each transitional initial video frame according to eachcolor channel component of the first color value, the number of thetransitional initial video frames that need to be generated, and thechange amount of each color channel component.

In an embodiment, pixel locations of all pixels of the first initialvideo frame and the second initial video frame are separately used asthe target location; or, the color value of the pixel of the targetlocation in the first initial video frame is black, and the color valueof the pixel of the target location in the second initial video frame iswhite; or the color value of the pixel of the target location in thefirst initial video frame is white, and the color value of the pixel ofthe target location in the second initial video frame is black.

In an embodiment, when configured to perform play control according tothe transitional initial video frame, the processor 801 is configuredto: detect a motion direction; select an initial video frame groupassociated with the motion direction, to obtain a target initial videoframe group; and perform play control according to the target initialvideo frame group.

In an embodiment, when configured to perform play control according tothe target initial video frame group, the processor 801 is configuredto: obtain a visual field angle parameter of the terminal; determine,according to the visual field angle parameter, a pixel range projectedonto the terminal display interface; perform pixel coordinate matchingon each initial video frame in the target initial video frame groupaccording to the pixel range, to form a transitional play video frameand a second play video frame; and sequentially play the transitionalplay video frame and the second play video frame according to a timeorder.

In this embodiment of this application, reference may be made todescriptions of related content in the foregoing embodiments forimplementation of each function of the processor 801.

In this embodiment of this application, a second time point can bedetermined through prediction, and some transitional video frames aregenerated according to the video frame of the second time point and thevideo frame of the current first time point, so that it can be ensuredthat a degree of a pixel color change between the video frame played atthe first time point and the video frame played at the second time pointis relatively small, and bad stimulation caused to eyes of the userduring playing of the panoramic video can be effectively reduced, and inthe entire process, the user does not need to manually adjust theplaying brightness of devices such as a VR device, to meet therequirements of automation and intelligentization of the user.

In an embodiment, FIG. 9 provides a diagram of an internal structure ofan intelligent terminal. The terminal includes a processor, a memory, anetwork interface, an input apparatus, and a display that are connectedvia a system bus. The memory includes a non-volatile storage medium andan internal memory. The non-volatile storage medium of the computerdevice stores an operating system, and may further storecomputer-readable instructions. The computer-readable instructions, whenexecuted by the processor, may cause the processor to implement a videoprocessing method. The internal memory may also store computer-readableinstructions. The computer-readable instructions, when executed by theprocessor, may cause the processor to implement a video processingmethod. The display of the computer device may be a liquid crystaldisplay or an e-ink display. The input apparatus of the computer devicemay be a touch layer covering the display, or may be a button, atrackball, or a touchpad disposed on a housing of the computer device,or may be an external keyboard, touchpad, a mouse or the like. A personskilled in the art may understand that, the structure shown in FIG. 9 ismerely an exemplary block diagram of a structure of a part related to asolution of this application, and does not constitute any limitation onthe server to which the solution of this application is applied. Aspecific server may include more or fewer components than those shown inthe figure, or some components may be combined, or a different componentdeployment may be used.

It should be understood that the steps in the embodiments of thisapplication are not necessarily performed in an order indicated by thestep numbers. Unless explicitly stated in this specification, theexecution of these steps is not strictly sequential, and these steps maybe executed in another sequence. Moreover, at least some of the steps ineach embodiment may include a plurality of sub-steps or stages, whichmay not necessarily be completed at the same moment, but may beperformed at different moments. These sub-steps or stages are notnecessarily performed in sequence, but may be performed in turn oralternately with at least some of other steps or sub-steps or stages ofthe other steps.

A person of ordinary skill in the art may understand that some or allprocedures in the method in the foregoing embodiments may be implementedby a computer-readable instruction instructing related hardware, theprogram may be stored in a non-transitory or non-volatile computerreadable storage medium, and when the program is executed, theprocedures in the foregoing method embodiments may be implemented. Anyreference to a memory, storage, database or other medium used in thevarious embodiments provided herein may include non-volatile and/orvolatile memory. The non-volatile memory may include a read-only memory(ROM), a programmable ROM (PROM), an electrically programmable ROM(EPROM), an electrically erasable programmable ROM (EEPROM), or a flashmemory. The volatile memory may include a random access memory (RAM) oran external cache. By way of illustration and not limitation, the RAM isavailable in various forms, such as a static RAM (SRAM), a dynamic RAM(DRAM), a synchronous DRAM (SDRAM), a double data rate SDRAM (DDRSDRAM),an enhanced SDRAM (ESDRAM), a synchronization link (Synchlink) DRAM(SLDRAM), a rambus (Rambus) direct RAM (RDRAM), a direct rambus dynamicRAM (DRDRAM), and a rambus dynamic RAM (RDRAM).

The foregoing disclosure is merely some embodiments of this application,and certainly is not intended to limit the protection scope of thisapplication. A person of ordinary skill in the art can understand all orsome flows of the foregoing embodiments, and equivalent variations madein accordance with the claims of this application shall fall within thescope of this application.

What is claimed is:
 1. A video processing method, comprising:determining, by circuitry of a terminal, a second video frame portionassociated with a second time point that is after a first time pointassociated with a first video frame portion of a video; generating atransitional video frame based on differences between first color valuesof pixels in the first video frame portion and second color values ofpixels in the second video frame portion, a transitional color value ofa pixel at a target pixel location in the transitional video frame beingwithin a target color interval, and the target color interval beingdetermined according to a first color value of the first color valuesthat corresponds to the pixel at the target pixel location in the firstvideo frame portion and a second color value of the second color valuesthat corresponds to the pixel at the target pixel location in the secondvideo frame portion; and performing, by the circuitry, display controlof the video according to the transitional video frame, wherein thefirst color value that corresponds to the pixel at the target pixellocation in the first video frame portion is one of black and white, andthe second color value that corresponds to the pixel at the target pixellocation in the second video frame portion is the other one of black andwhite.
 2. The method according to claim 1, wherein the determiningcomprises: obtaining, by the circuitry, motion speed information;obtaining, by the circuitry, second pose information of the second timepoint according to first pose information of the first time point, themotion speed information, and a difference between the first time pointand the second time point; obtaining, by the circuitry, a second videoframe of the video to be displayed at the second time point; andobtaining, by the circuitry, the second video frame portion from thesecond video frame according to the second pose information.
 3. Themethod according to claim 2, wherein a width of the second video frameportion obtained from the second video frame is greater than a width ofa video image that is displayed; and the displayed video image is apartial image of the second video frame.
 4. The method according toclaim 3, wherein a height of the second video frame portion obtainedfrom the second video frame is greater than a height of the displayedvideo image.
 5. The method according to claim 2, wherein the motionspeed information indicates rotation angular velocity values in at leasttwo different motion directions, the second pose information is visualfield orientation angle information, and a plurality of different motiondirections is associated with initial video frame groups, and eachinitial video frame group includes a candidate second video frameportion and a candidate transitional video frame generated based on thesecond video frame portion.
 6. The method according to claim 1, furthercomprising: obtaining, by the circuitry, a video frame refresh frequencyof the terminal; obtaining, by the circuitry, a video frame playfrequency during playing of the video; and determining, by thecircuitry, a number of transitional video frames according to the videoframe refresh frequency and the video frame play frequency, wherein thegenerating includes generating the determined number of transitionalvideo frames.
 7. The method according to claim 1, wherein the generatingcomprises: obtaining, by the circuitry, the first color value of thepixel of the target pixel location in the first video frame portion;obtaining, by the circuitry, the second color value of the pixel of thetarget pixel location in the second video frame portion; obtaining, bythe circuitry, the transitional color value of the transitional pixel atthe target pixel location based on an interpolation according to thefirst color value and the second color value; and generating, by thecircuitry, the transitional video frame according to pixel values of aplurality of transitional pixels, the plurality of transitional pixelsincluding the transitional pixel.
 8. The method according to claim 7,wherein the generating the transitional video frame comprises:performing, by the circuitry, differential calculation between colorchannel components in the first color value and color channel componentsin the second color value, to obtain component differences of the colorchannel components; obtaining, by the circuitry, change amounts of thecolor channel components according to a number of transitional videoframes that need to be generated; and obtaining, by the circuitry, thetransitional color value of the transitional pixel of the target pixellocation of each of the number of transitional video frames according tothe color channel components of the first color value, the number of thetransitional video frames that need to be generated, and the changeamounts of the color channel components.
 9. The method according toclaim 7, wherein the generating comprises: generating the transitionalvideo frame based on all pixels of the first video frame portion and allpixels of the second video frame portion.
 10. The method according toclaim 5, wherein the performing comprises: detecting, by the circuitry,a motion direction; selecting, by the circuitry, an initial video framegroup associated with the motion direction, to obtain a target initialvideo frame group; and performing, by the circuitry, the display controlof the video according to the target initial video frame group.
 11. Themethod according to claim 10, wherein the performing comprises:obtaining, by the circuitry, a visual field angle parameter of theterminal; determining, by the circuitry according to the visual fieldangle parameter, a pixel range to be displayed; performing, by thecircuitry, pixel coordinate matching on each video frame portion in thetarget initial video frame group according to the pixel range, to form atransitional play video frame and a second play video frame; andsequentially playing, by the terminal, the transitional play video frameand the second play video frame.
 12. A computer device, comprising:circuitry configured to determine a second video frame portionassociated with a second time point that is after a first time pointassociated with a first video frame portion of a video; generate atransitional video frame based on differences between first color valuesof pixels in the first video frame portion and second color values ofpixels in the second video frame portion, a transitional color value ofa pixel at a target pixel location in the transitional video frame beingwithin a target color interval, and the target color interval beingdetermined according to a first color value of the first color valuesthat corresponds to the pixel at the target pixel location in the firstvideo frame portion and a second color value of the second color valuesthat corresponds to the pixel at the target pixel location in the secondvideo frame portion; and perform display control of the video accordingto the transitional video frame wherein the first color value thatcorresponds to the pixel at the target pixel location in the first videoframe portion is one of black and white, and the second color value thatcorresponds to the pixel at the target pixel location in the secondvideo frame portion is the other one of black and white.
 13. Thecomputer device according to claim 12, wherein the circuitry isconfigured to obtain motion speed information; obtain second poseinformation of the second time point according to first pose informationof the first time point, the motion speed information, and a differencebetween the first time point and the second time point; obtain a secondvideo frame of the video to be displayed at the second time point; andobtain the second video frame portion from the second video frameaccording to the second pose information.
 14. The computer deviceaccording to claim 13, wherein a width of the second video frame portionobtained from the second video frame is greater than a width of a videoimage that is displayed; and the displayed video image is a partialimage of the second video frame.
 15. The computer device according toclaim 14, wherein a height of the second video frame portion obtainedfrom the second video frame is greater than a height of the displayedvideo image.
 16. The computer device according to claim 13, wherein themotion speed information indicates rotation angular velocity values inat least two different motion directions, the second pose information isvisual field orientation angle information, and a plurality of differentmotion directions is associated with initial video frame groups, andeach initial video frame group includes a candidate second video frameportion and a candidate transitional video frame generated based on thesecond video frame portion.
 17. The computer device according to claim12, wherein the circuitry is configured to obtain a video frame refreshfrequency of the computer device; obtain a video frame play frequencyduring playing of the video; determine a number of transitional videoframes according to the video frame refresh frequency and the videoframe play frequency; and generate the determined number of transitionalvideo frames.
 18. The computer device according to claim 12, wherein thecircuitry is configured to obtain the first color value of the pixel ofthe target pixel location in the first video frame portion; obtain thesecond color value of the pixel of the target pixel location in thesecond video frame portion; obtain the transitional color value of thetransitional pixel at the target pixel location based on aninterpolation according to the first color value and the second colorvalue; and generate the transitional video frame according to pixelvalues of a plurality of transitional pixels, the plurality oftransitional pixels including the transitional pixel.
 19. At least onenon-transitory computer-readable storage medium storing instructionswhich when executed by one or more processors cause the one or moreprocessors to perform: determining a second video frame portionassociated with a second time point that is after a first time pointassociated with a first video frame portion of a video; generating atransitional video frame based on differences between first color valuesof pixels in the first video frame portion and second color values ofpixels in the second video frame portion, a transitional color value ofa pixel at a target pixel location in the transitional video frame beingwithin a target color interval, and the target color interval beingdetermined according to a first color value of the first color valuesthat corresponds to the pixel at the target pixel location in the firstvideo frame portion and a second color value of the second color valuesthat corresponds to the pixel at the target pixel location in the secondvideo frame portion; and performing display control of the videoaccording to the transitional video frame, wherein the first color valuethat corresponds to the pixel at the target pixel location in the firstvideo frame portion is one of black and white, and the second colorvalue that corresponds to the pixel at the target pixel location in thesecond video frame portion is the other one of black and white.
 20. Themethod according to claim 1, wherein the video is a virtual realityvideo, and the first video frame portion and the second video frameportion correspond to different viewpoints within the virtual realityvideo.